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The force laws governing the extension behavior of homopolypeptides are obtained from a phe- 
nomenological free energy capable of describing the helix-coil transition. Just above the melting 
temperature of the free chains, T* , the plot of force, /, vs. end-to-end distance, R, exhibits two 
plateaus associated with coexistence of helical and coil domains. The lower plateau is due to tension 
induced onset of helix-coil transition. The higher plateau corresponds to the melting of the helices 
by overextension. Just below T* the fR plot exhibits only the upper plateau. The fR plots, the 
helical fraction, the number of domains and their polydispersity are calculated for two models: In 
one the helical domains are viewed as rigid rods while in the second they are treated as worm like 
chains. 
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I. INTRODUCTION 

Single molecule biomechanical experiments allow to 
measure the forces generated by biomolecules and their 
response to applied forces Q . For "passive" biopolymers, 
that do not produce active movement against load, the 
resulting force-extension curves provide a probe of the 
nonlinear elasticity of the chains and its relationship to 
configurational changes. Such studies were carried out 
for single stranded and double stranded DNA [^]-[|, for 
the muscle protein Titin |^^, the extracellular matrix 
protein Tenascin ||, the polysaccharides Dextran |l(]] 
and Xanthan jyj, as well as the synthetic polymer 
poly(ethylene-glycol) Theoretical modeling of the 

elastic behavior of these polymers is often hampered by 
lack of detailed knowledge concerning the configurations 
involved and the relevant interactions. A direct and de- 
tailed confrontation between experiment and theory is 
however possible for the case of homopolypeptides capa- 
ble of forming an a-helix. In this case the molecular con- 
figurations are well understood. Furthermore, one can 
relate the elastic force law to the known Zimm-Bragg pa- 
rameters that characterize the helix-coil transition of the 
free polypeptides. As we shall discuss, the extension force 
law of a long polypeptide just above the melting temper- 
ature of the helix, T* , exhibits two plateaus. Both are 
associated with a coexistence of helical and coil domains. 
The low-tension plateau is due to helix formation induced 
by the chain extension and the accompanying loss of con- 
figurational entropy. The second, high-tension plateau is 
due to the extension induced melting of the helices when 
the end-to-end length exceeds the length of the fully he- 
lical chain. For T < T* only the second plateau survives 
but the force law for weak extensions exhibits a "quasi 



plateau" i.e., a regime with a weak slope that is not as- 
sociated with a coexistence. This last regime is due to 
the facile alignment of the long persistent regions in the 
helix. In the following we analyze the force laws, their 
dependence on temperature and the corresponding pop- 
ulation distribution of helical and coil segments. While 
we are unaware of single molecule experiments on this 
system, similar behavior was observed experimentally on 
fibers of Collagen during the 1950s fDlfl . 

Two different treatments of this problem were recently 
advanced 0,^5| ■ In the present article we present a uni- 
fied analysis of the problem, tracing the physical origins 
of the disagreements between the force laws obtained in 
the two earlier publications and analyzing their range 
of validity. Our treatment is distinctive in that it is 
based on free energy argument instead of the custom- 
ary transfer matrix approach [p~6| P~8|| - This free energy 
argument allows to recover the results of the transfer ma- 
trix method but is physically transparent and relatively 
simple mathematically. Within this approach we con- 
sider the extension of a very long homopolypeptide ca- 
pable of forming a-helices. The discussion is limited to 
quasi-static extension assuming that the rate of equili- 
bration of the configurations of the chain is much faster 
than the rate of extension. Two models are explored. In 
one, the helical domains are viewed as rigid rods while 
in the second, they are modeled as semiflexible chains. 
For each model we present a rigorous derivation of the 
corresponding force law allowing for the polydispersity of 
the helical and coil domains and the associated mixing 
entropy (see Fig. |l]). In addition, we explore two useful 
approximations allowing for simplified calculations of the 
force law. In one approximation, the mixing entropy of 
the helices and domains is neglected and the plateau is 
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associated with coexistence of one helical domain with 
one coil domain (see Fig. Within this "diblock" or 
u S m ix = 0" approximation the coexistence regimes in- 
volve a first order phase transition. In spite of this erro- 
neous conclusion, this approximation correctly identifies 
the important length and force scales in the problem. 
The second, mean field approximation, neglects the di- 
rect coupling of the distribution of domains sizes and the 
applied tension. As we shall see, this approximation is 
actually exact when the helical domains are modeled as 
long semiflexible chains. 




FIG. 1. Schematic picture of the stretching of a polypep- 
tide comprizing different helical domains (thick lines) and coil 
domains (thin lines). 

The problem and the analysis are of interest from a 
number of perspectives. Homopolypeptides are the sim- 
plest representatives of helicogenic molecules. In this case 
it is not necessary to allow for the sequence heterogeneity 
which affects the elasticity of heteropolypeptides. Nor is 
it necessary to consider the long range interaction that 
occur in multiple helices. As a result, the number of pa- 
rameters invoked in the theory is smaller and the math- 
ematical analysis is simpler. The force laws obtained are 
determined by known Zimm-Bragg parameters with no 
adjustable parameters. Accordingly this system enables 
a direct comparison between experiment and theory. In 
this juncture it is important to note the availability of 
synthetic methods for producing long polypeptides with 
a well defined architecture fll9| . Furthermore, the formal- 
ism described provides a convenient basis for the analy- 
sis of more complex helicogenic polymers such as DNA 
and Collagen. From a polymer science point of view the 
elasticity of homopolypeptides is of interest because it 
reflects the effects of internal degrees of freedom arising 
from intrachain self-assembly. The study of these sys- 
tems enables the exploration of the distinctive non-linear 
force laws that are the signatures of such intrachain self- 
assembly. Finally, this elasticity plays a role in the stabi- 
lization of the helical state of grafted polypeptides [E0| . 
In turn this is of importance for the function of fuso- 
genic polypeptides and the design of surfaces that favor 
spreading and growth of cells [^f] [24) . 

The paper is organized as follows. In section II we in- 
troduce the free energy argument via an analysis of the 
helix-coil transition in free, undeformed homopolypep- 
tides. A semiquantitative analysis of the effect of ex- 
tension is presented in section III. Two approaches are 
used. One focuses on the intersections of the free energy 
curves of a pure coil and a pure helix as a function of 



the end-to-end distance, R. As we shall see, each inter- 
section is a rough diagnostic for a transition between the 
two configurations and for the occurrence of a plateau in 
the force law. The second, more quantitative approach 
is the diblock or S m i X = approximation. Within this 
approximation the homopolypeptide is assumed to con- 
sist of two domains, a helix and a coil, and the mixing 
entropy associated with the polydispersity of the helical 
and coil domains is set to zero, S m i X = 0. Finally, in sec- 
tion IV we present a rigorous analysis of the force laws 
allowing for the polydispersity of the coexisting helices 
and coil domains. As stated earlier, we implement this 
analysis for two models. In one the helical domains are 
viewed as rigid rods and in the second as semiflexible 
chains described by the worm like chain (WLC) model. 
In this section we also examine the validity of the mean 
field approximation introduced in [fL4| . Within this ap- 
proximation, the explicit coupling between the tension 
and the distribution of domain sizes is ignored. As we 
shall see, it recovers the exact result when the helices 
are semiflexible i.e., when the persistence length of the 
helices is finite and the helical domains are large. The 
analysis of the rigid helices model recovers the results 
obtained by Tamashiro and Pincus JTBfl using a transfer 
matrix formalism. A comparison between the two mod- 
els and an outline of some open problems are presented 
in the Discussion. 



II. HELIX-COIL TRANSITION 

It is helpful to first summarize the main features of the 
helix-coil transition of free polypeptides in the absence of 
imposed extension. We will consider only homopolypep- 
tides, consisting of a single type of amino acid residues, 
in order to avoid complications introduced by sequence of 
different residues found in heteropolypeptides. Our dis- 
cussion focuses on the case of long chains where the de- 
gree of polymerization of the polymer N is large, N 3> 1. 
To this end we first discuss the Zimm-Brag parameters 
and then introduce the free energy analysis of the tran- 
sition. As we shall see, this is equivalent to the transfer 
matrix formalism [B5| that is traditionally used to ana- 
lyze the transition p6|-|l8| . 



A. Zimm-Bragg parameters 

Our discussion focuses on helicogenic polypeptides 
that are capable of assuming two configurations: a ran- 
dom coil and an a-helix. In the coil state the monomers 
are free to rotate thus leading to a small Kuhn length 
with typical value of l c ~ 18A corresponding to the 
length of few monomers. For simplicity we will iden- 
tify the Kuhn length with the span of a single monomer, 
a ~ 3.8A. The helical configuration involves hydrogen 
bonds between residues i and i + 3. This constrains the 
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orientation of the three intermediate monomers. As a re- 
sult the persistence length P of the helical configuration 
is large, of order P ~ 2000A At the same time the pro- 
jected length of a monomer along the axis of the helix 
decreases to ah ~ 1.5A. The length of the polypeptide in 
a helical configuration is thus smaller than the span of a 
fully extended coil by a factor of 7 = ah /a ~ 2/5 [[Hi] [l8[ . 

The homopolypeptides are modeled as a two state sys- 
tem. Each monomer can exist either in a helical or a 
coil state. Choosing the coil state as a reference state, 
the excess free energy Af of a monomer within a helical 
domain reflects two contributions. First is the change in 
enthalpy upon formation of an intrachain iZ-bond, Ah. 
The second is associated with the loss of configurational 
entropy, As, in the helical state. Altogether 

Af ~ Ah-TAs. (1) 

The enthalpy term favors the helical state while the en- 
tropy term favors the coil state. Thus, at low tempera- 
tures the free energy Af is negative and the helical con- 
figuration is preferred while at high temperatures Af is 
positive and the coil state is dominant. The two terms 
are comparable at T* ~ Ah/ As when Af ~ 0. T* pro- 
vides an approximate value for the melting temperature 
of the a-helix. 

An additional parameter is needed to describe the 
helix-coil transition. The number of //-bonds in a he- 
lical domain consisting of n monomers is n — 2. It is thus 
necessary to allow for the special state of the terminal 
monomers of the helical domain. These two monomers 
lose their configurational entropy with no gain due to 
the formation of iZ-bonds. Each of the terminal bonds is 
thus assigned an excess free energy of 

Af t ~ -TAs (2) 

with respect to the coil state. In the following we will 
use Aft measured with respect to the helical state, that 
is 

Aft ~ -Ah. (3) 

As we shall discuss later, this definition is consistent with 
the known experimental results. Clearly, this definition 
is consistent with the traditional one when the coil state 
is used as a reference. Aft plays the role of an interfacial 
free energy associated with the boundary between helix 
and coil domains. This term makes the helix-coil tran- 
sition cooperative since it favors the formation of large 
domains. 

It is customary to describe the helix-coil transition in 
terms of the Zimm-Bragg parameters [ p6| : 

s = exp(-A//feT), (4) 
a = cxp (-2A ft/XT) (5) 

where k is the Boltzmann constant. The T dependent s 
represents the Boltzmann factor associated with adding 



one monomer to a helical domain, s ~ 1 at the transition 
temperature T* while s > 1 for T < T* and s < 1 for 
T > T*. a corresponds to the Boltzmann factor associ- 
ated with the creation of a helical domain. The definition 
of Aft as —TAs suggests that a is independent of T. The 
definition of Aft as —Ah does not predict that a is in- 
dependent of T but it yields an identical estimate for the 
numerical value of a |27| . The experimentally measured 
a, as obtained from plots of the helical content vs. T, are 
in the range of 10 — 3 — 10~ 4 , depending on the residue, 
and exhibit a weak T dependence. As stated earlier, in 
the following we will use a = exp (2Ah/kT). As we shall 
discuss, a plot of the helical content vs. s exhibits a sig- 
moid behavior and the width of the transition scales with 
TV 1 / 2 . Thus, the transition becomes sharper, more co- 
operative, as a decreases. 



B. The Helix-Coil Transition: A Free Energy 
Approach 



Having defined the relevant molecular parameters, Af 
and Aft or equivalently s and er, we are in a position 
to analyze the helix-coil transition as it occurs in free 
chains upon changing the temperature T. The mini- 
mal description of the transition requires the specifica- 
tion of two properties, the total number of monomers in 
a helical configuration, N9 and the number of helical do- 
mains, Ny. While 9 and y are sufficient for a discussion 
of the helix-coil transition of free chains, the analysis of 
the effect of extension requires additional information. 
In particular, the probability distribution of helical do- 
mains, Ph(n), and of coil domains, P c (n), comprising of 
n monomers. In order to determine these parameters we 
minimize the appropriate phenomenological free energy. 
This method recovers the results obtained by the trans- 
fer matrix approach and is easily generalized to allow for 
the effect of applied tension. 

The free energy allows for two principal contributions: 
(i) The free energy of the monomers. All NO helical 
monomers, including the terminal monomers of the do- 
mains are assigned an excess free energy of Af. The 
2Ny terminal monomers are assigned an additional free 
energy of Aft — —Ah. As a result their excess free en- 
ergy with respect to the coil state is Aft — —TAs. (ii) A 
mixing entropy term associated with the different possi- 
ble placements of N9 helical monomers and 2Ny termi- 
nal monomers constituting the domain boundaries. Since 
the helical and coil domains alternate, there is no entropy 
due to their mixing. Rather, the mixing entropy arises 
because of the polydispersity of the domains i.e., heli- 
cal and coil domains of different size n are considered 
distinguishable. Since there are Ny helical and Ny coil 
domains the mixing entropy is 
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-Nky i p h{n) In P h (n) + P c (n) lnP c (n)} . 

(6) 

The free energy per monomer in the unperturbed chain 
is thus 



F /NkT = -ehxs-yhxa 

CO co 

+y ^ P fe (n) In P fe (n) + y ^ P c (ri) In P c (n) 



(7) 



n=l 



n=l 



' °° o\ ( °° 1 - # 

2 rcP ft (r*) - - - ^ E nP » 

\n=\ y J \n=i y 



The Lagrange multipliers, ^ and \i\ , ensure the normal- 
ization Y^=\Ph{n) = 1 and ^2^—-\P c (n) — 1. The two 
remaining Lagrange multipliers, and /j,^, impose the 
average sizes kh = 9/y and k c — (1 — of the helical 
and coil domains. This description corresponds to a sim- 
plified version of the transfer matrix method involving a 
two by two matrix. In both cases there is no constraint 
ensuring that helical domains incorporate at least three 
monomers [2^] . Since we focus on the case of a <C 1 and 
N ^> 1 the weight of small domains is negligible and this 
description yields the correct results. 

The probabilities Ph{n) and P c {n) are determined by 
minimization of Fq with respect to Ph{n) and P c (n) (Ap- 
pendix A) leading to 



Ph(n) = 
Pc(n) = 



y 



y 



-y V o 

V (1-0 -y 



1 



y V i 



(8) 
(9) 



Note that Ph(n) and P c (n) decay exponentially with 
decay constants A/, = — l/ln(l — y/6) and A c = 
-l/ln[l-y/(l-0)]. Thus, P h (n) and P c (n) are not 
sharply peaked at the mean values kh = 0/y and k c = 
(1 — 0)/y. In particular, there is a significant population 
of small helices even when the average size of the heli- 
cal domain kh = 9/y is large. In this limit, — kh- 
Similarly, when k c = (1 — 0)/y is large A c ~ k c . 

Upon substituting P/ l (n) and P c (n) in Fq, as given by 
(0) , we obtain Fq as a function of and y only |2jJ : 

-0]ns -ylna + (6 -y)ln— — + yln^ (10) 



NkT 



(l-g-y)ln 1 - 6 n V +yhx- V 



1-9 

The equilibrium conditions OFq / d6 
then lead to 



and dF /dy = 



(l-6)(6-y) = 89{l-9-y), (11) 
y 2 =a(9-y)(l-0-y). (12) 



In turn, these yield (Appendix B) the expression for the 
equilibrium values of 9 and y 
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2 as [( 



2 y/( s - l) 2 +4 as 
1)2 ., ,__i-V2 



4 crs] ' 



+ 1 + ^(5-1)2+4, 



(13) 
(14) 



These expressions are identical to the ones obtained via 
the transfer matrix method mM. 
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FIG. 2. The helical fraction 9 (a) and the domain fraction 
y (b) are plotted as a function of s for long polypeptides free 
of tension. In all cases, the helix-coil transition occurs in the 
vicinity of s — 1 however, the width of the transition region 
decreases with a. 

The behavior of 9 and y as a function of s and a is 
plotted in Fig. ||. When a — 1 we recover the Boltz- 
mann distribution for independent two-level systems: 
9 = s/(l + s). In marked contrast when a = 0, the 
fraction of helical monomers is steplike: 9 = 1 for s > 1 
and 9 = for s < 1. The chain is either a pure helix or a 
pure coil. For intermediate values of a, the plot of 9 vs. 
s yields a sigmoid curve with a transition region of width 
cr 1 / 2 centered around s = 1. The number of domains is 
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maximal in the vicinity of s ~ 1 and it decreases with a. 
The parameter s = exp (— Af/kT) serves as a measure 
of the temperature, T. For low temperatures, s> 1 and 
the polymer is mainly in a helical state. At high temper- 
atures, s C 1 and the polymer is mostly in a coil state. 
The crossover occurs at T* = Ah/ As for which 8 = 1. 
Near this crossover, s — 1 ~ T* — T and the width of the 
crossover regime is AT ~ T*a x l 2 . 



III. EXTENSION BEHAVIOR: QUALITATIVE 
ANALYSIS 

The most notable feature of the force laws of ho- 
mopolypeptides are the plateaus. In this section we 
present qualitative and semiquantitative analysis of the 
plateaus. Each plateau in the force law corresponds to a 
coexistence between two states, a helix and a coil. The 
situation is somewhat analogous to a liquid-gas transi- 
tion which leads to a plateau in the pressure-volume di- 
agram. In our case, the tension / plays the role of the 
pressure and the end-to-end distance R is analogous to 
the volume. This analogy is however of limited validity 
since the homopolypeptides are one-dimensional systems 
with short range interactions and thus incapable of un- 
dergoing a first order phase transition. In such systems 
the boundary free energy is independent of the size of 
the domain. As a result, a first order phase transition 
involving a coexistence of two domains is prevented by 
the mixing entropy p0| . Accordingly, the average do- 
main sizes k c = (1 — 0)/y and kh = 0/y, as obtained in 
the previous section, are independent of the size of the 
chain, N. Nevertheless, the main features of the system 
can be recovered if one ignores the role of the mixing 
entropy As expected, within this rough approximation 
the plateaus are erroneously associated with first order 
phase transitions. In spite of this deficiency, this approx- 
imation recovers the correct length and force scales but 
the force laws obtained are wrong in detail. Before we 
present this approximation we investigate the crossing of 
the free energy curves associated with the "pure" helix 
and coil states. This is a rough diagnostic for first or- 
der phase transitions. In our case it provides an insight 
concerning the physical origin of the plateaus as well as 
a reasonable estimate for some of the length and force 
scales involved. 



A. Free Energy Curve Crossing 

The occurrence of the plateaus is signaled by the cross- 
ing of the free energy curves of the pure helix and the pure 
coil as a function of R. Each crossing point signals a co- 
existence between the two "phases" and thus a plateau 
in the force law. 

With the unperturbed coil as a reference state, the free 
energy of the coil is simply its elastic free energy. In this 



section, we will utilize the fixed R ensemble and we thus 
denote the elastic free energy by F ei (R) . To allow for the 
finite extensibility of the chain we use the freely jointed 
chain (F JC) model (Appendix C) for F e i (R) and the free 
energy per monomer is 

^S. = rx-C int (x) (15) 

where Ci n t(x) = In [sinh(a;)/x], r — R/Na is the reduced 
end-to-end distance of the polymer and x = fa/kT is 
the reduced force, r and x are related via r = C(x) = 
coth(x) — 1/x where C{x) is the Langevin function. 

The free energy of the helix reflects two contribu- 
tions. First is the excess free energy of a helix, Fh — 
NAf + 2A/ t . In the limit of an infinite chain, when the 
contribution of the terminal monomers is negligible, the 
free energy per monomer is Fh/NkT = — Ins. At this 
stage we assume that the persistence length of the helix is 
infinite and that it may be viewed as a perfectly rigid and 
unextendable rod. Within this view, extending the helix 
beyond its natural length is associated with an infinite 
free energy penalty. This penalty is important because 
the helix is shorter than the fully extended coil. As noted 
earlier, the contribution of each helical monomer to the 
overall length of the helix is ah — ja < a. Altogether, 
Fh/NkT — — Ins so long as r = R/Na < 7 while in the 
range r > 7 it is Fh/NkT = 00. 
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FIG. 3. The occurrence of plateaus in the extension force 
laws is signaled by crossing of the free energy curves corre- 
sponding to a polypeptide in a pure coil form (continuous 
curve) and in a fully helical form (dashed lines). 

Three scenarios are now possible (Fig. |J). At low tem- 
peratures, T <T* and s > 1. Accordingly, F h /NkT < 
while r < 7. In this regime the helix is more stable than 
the coil. When r > 7 the free energy of the helix diverges 
and the coil state becomes preferable. This curve cross- 
ing corresponds to a plateau involving the breakdown of 
the helix due to the applied tension. Above the melting 
temperature, for T > T* and F h /NkT = -Ins > 
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the unstretched polypeptide is in a coil state. Upon 
stretching F e i/NkT increases and eventually intersects 
with Fh/NkT. This intersection signals a plateau in 
the force law due to the formation of helical domains. 
At r = 7 the free energies F e i/NkT and Fh/NkT cross 
again because of the infinite free energy of a helix with 
r > 7. This second intersection signals a second plateau 
involving the melting of the helix and the formation of 
coil domains. Finally, for sufficiently high temperatures, 
T > T c , no intersections occur and the polypeptide re- 
mains in a coil state for all r values. The threshold tem- 
perature for this regime, T c , is defined by the equality 
F e i(Nja) — Fh where F e i(Nja) is the free energy of a 
stretched coil with r = 7. 



B. The S m ix = approximation 

The intersection of the free energy curves provides a 
rough indicator for the occurrence of plateaus in the force 
law. However, it does not allow to calculate an explicit 
force-extension diagram. A simple approach yielding the 
main features of the force law is the S m i X — or di- 
block approximation pi 31. Within this approximation 



the force law is calculated for a diblock copolymer with 
one helical domain and one coil domain. The size of the 
annealed domains varies with the extension and is deter- 
mined by the equilibrium conditions. As is implied by 
its name, in this approximation S m i x is neglected. As 
expected, setting S m i X = leads to a first order phase 
transition associated with a coexistence of two "phases" . 
Consequently, some features of the force law are wrong. 
In particular, this approximation yields a perfect plateau 
with f(R) — const' while in reality / increases weakly 
with R. These features notwithstanding, this simple ap- 
proximation does recover the correct length and force 
scales. Within this discussion we will also briefly exam- 
ine the effect of the persistence length of the helical do- 
mains. In particular we compare the behavior of a helix 
endowed with an infinite persistence length to that of a 
helix with a finite one. 

Within the S m i X — or diblock approximation the free 



energy F chain of the chain is 



chain 



= N9Af + 2Af t + F el (9, 1 ,R). 



(16) 



Here, the first term allows for the excess free energy of 
the helical monomers. Since S m i X — the helical and 
coil phases separate into two domains. The "intcrfa- 
cial" free energy of the single helical domain gives rise 
to the second, constant term. These two terms are sup- 
plemented by an elastic free energy F e i that depends on 
0, 7 and the end-to-end distance, R. Different versions 
of the S m i X = are possible, depending on the choice 
of F e i. In the following we focus on the simplest ver- 
sion where F e i is given by the Gaussian elasticity of an 
ideal random coil in the fixed R ensemble. Two addi- 
tional assumptions are invoked: (i) The helical domain is 



perfectly rigid and unextendable and (ii) the helical do- 
main is perfectly aligned with the applied force. These 
assumptions specify the elastic free energy per monomer 



F e i _ 3 (r- j&Y 
2 



NkT 



1 - 1 



(17) 



This simple version of the diblock approximation permits 
an explicit analytical solution for the force law. The cor- 
responding equilibrium condition is 



&F c hai 

d6 



= = - 



3 (r - 7 0) 5 



2 (1 - 



-37. 



'-76 1 
1-0 



Within this approximation the equilibrium condition 
d Fchain / dy = is an identity yielding no additional in- 
formation. The solution of this quadratic equation in 

X = (r-70)/(l-0) is X± = 7 (l± 

defines a critical Af 



1 



2 A/ 

3 7 2 fcT 



This 



Af c /kT c = 3 7 2 /2 



(19) 



corresponding to a critical s c = exp(— Af c /kT c ) or T c . 
While Af > A/ c , the equilibrium condition can not be 
satisfied, that is, the polypeptide remains in a coil state 
irrespective of the imposed tension. A critical point oc- 
curs for A/ = A/ c . When < A/ < A/ c two solutions 
exist, each defining a plateau in the force vs. extension 
curve. The solution A_ corresponds to the lower plateau, 
obtained for r < 7, where 



= 1 - 



1 — r/7 



y/1 - Af/Afc 



(20) 



This first regime disappears for Af < or T < T* . The 
condition 9 = defines a lower boundary of the plateau, 



= 7 1-V1-A//A/, 



(21) 



T\h increases as Af increases until r\L = 7 for Af = Af c . 
For r < r%L the fraction of helices is 9 = and the chain 
is in a purely coil form. The upper boundary of this 
plateau occurs at 



riu = 7 



(22) 



when the chain is fully helical, = 1. At intermediate 
extensions, t\l < r < r w the fraction of helical bonds 
increases as 



riL 



riu - nL 



(23) 



and 9 = 1/2 occurs at the midpoint of the plateau 
n(0 = 1/2) = (ri;/+rii)/2. The equilibrium free energy 
of the chain for r < r±L is Fchain/ NkT = 3r 2 /2 while for 
til < t < rw it is 



Rchain 

NkT ~ 



Af r - r 1L 



2 rw - til kT rw - til 



(24) 



G 



The second plateau, is described by the solution, X+, 
leading to 



= 1 - 



r/ 7 - 1 



(25) 



VI - A//A/ c 
The lower boundary of this plateau occurs at 

til = nu = 7 (26) 
when 9 = 1. The upper boundary, when 9 = 0, occurs at 

r w = 7 (l + y/l - A//A/ c ) . (27) 

For T2L < T < T2u the fraction of helical monomers de- 
creases as 



T2U 



T2U - T2L 



(28) 



and 9 = 1/2 occurs at the midpoint of the plateau 
r 2 {9 = 1/2) = (r 2 u + r 2 z,)/2. Notice that the width 
of the two plateaus shrinks as A/ — > A/ c or T — > T c . 
For r > r2(7 the fraction of monomers in a helical state 
is again 9 = and the corresponding free energy is 
Fchain/NkT = 3r 2 /2. The equilibrium free energy of 
the chain in the range r 2 L < f < ^2C/ is 



Q r 2 
°'2U 



r 2 L , A/ r 2t/ 



2 r 2 [/ - r 2L fcT r 2 u - r 2 L 



(29) 



Note that only the lower plateau is obtained if 7 = 1 is 
assumed [jl4| . 

The Smix = approximation allowed us to obtain 
rough force laws corresponding to the three scenarios 
identified by the curve crossing argument. Thus, for 
A/ > Af c or T > T c , the polypeptide remains in a coil 
configuration for the whole range of extension. The "two 
plateau scenario" occurs for < A/ < A/ c correspond- 
ing to s c < s < 1 or T* < T < T c . In this case we can 
distinguish five regimes (Fig. [|) : (I) For low extensions 
the chain is in a coil state, 9 = 0, and the elastic behav- 
ior is Gaussian leading to fa/kT = 3r. (II) In the range 
til < t < r\u helix and coil domains coexist according 
to the lever rule 9/(1 — 9) = (r — t\l)/{tiu — r ). This 
coexistence is associated with a plateau with a constant 
tension 



kT 



3r u 



(30) 



(III) At r\u = T2L — 7 the force curve exhibits a discon- 
tinuity associated with a step like increase to a second 
plateau. The magnitude of the step, 5f is 



6fa/kT=6jy/l-Af/Af c . 



(31) 



(IV) In the range r 2 L < r < r 2 j/ helix and coil do- 
mains coexist according to the lever rule 9/(1 — 9) = 
(t2U —t)/(t — T2l)- This coexistence is associated with 
a second plateau with a constant tension 



kT 



3r 2(7 . 



(32) 



(V) Finally, for r > r 2 y the chain is in a strongly 
stretched coil state obeying fa/kT = 3r. 



(I) 




(II) 



(in) 



FIG. 4. Schematic pictures of the stretching of a polypep- 
tide just above T* within the S m i X = approximation: (I) 
for small extension the chain is a weakly extended coil, (II) 
for r > r±L a helical domain coexists with a weakly extended 
coil domain, (III) at r = 7 the chain is fully helical, (IV) a 
helical domain coexists with a strongly extended coil domain 
for r > 7 and eventually, (V) for r > r-2u the chain is again 
fully in a coil state but in a highly stretched configuration. 



When A/ < 0, or T < T* , the unperturbed chain is 
in a helical state. Consequently, the lower plateau disap- 
pears. It is replaced by a "quasi plateau" which is not 
associated with a helix-coil coexistence. Rather, it is due 
to the facile orientation of the helical chain which is mod- 
eled as a rigid rod. At r = 7 a jump occurs to the upper 
plateau. 

The analysis presented above utilized the Gaussian 
elastic free energy in the fixed R ensemble to describe 
the coil. While this allows us to obtain explicit ex- 
pressions for all the characteristics of the force curve, it 
also introduces errors. Clearly, this choice of F e i does 
not capture the behavior of the coil in the high ex- 
tension regime, where finite extensibility begins to play 
a role. As a result, the values of r 2 (j are overesti- 
mated. This has two consequences. First, within this 
approximation the span of the two plateaus are equal 
T\u — til = T2U — r 2L = 7VT — A//A/c while for more 
realistic choice of F e i the higher plateau is narrower. A 
second more important pathology is the disappearance 
of regime (V) in the A/ < case. 
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FIG. 5. Plots of the reduced force x = fa/kT (a) and 
the helical fraction (b) vs. the reduced end-to-end distance, 
r = R/Na as obtained via the S m ix = approximation for 
the s c < s < 1 scenario. Both figures correspond to s — 0.9, 
7 = 2/5 and = 1000. The continuous curve corresponds 
to the case of rod like helices and Gaussian coils. The dashed 
lines depict the results obtained when the coils are described 
by the FJC model and the helices as rigid rods (short dashed 
line) or by the WLC model (long dashed line). The roman 
numerals identify the different regimes as depicted in Fig. ^. 

To allow for the finite extensibility it is necessary to 
use the freely jointed chain model (FJC) for the coil 
segment |52|. It is important to note that so long as 
7 < 0.5 and A/ > the discrepancies between the two 
schemes are rather small (Fig. ||). Another ingredient of 
our simplified analysis is the modeling of the helices as 
rigid and unextendable rods. An alternative description 
of the elastic properties of helices models them as worm 
like chains i.e., semiflexible chains capable of undergo- 
ing undulations (see Appendix D) Q. As we shall see 
later, this semiflexible approximation of the helices has 
a qualitative effect on the results of the rigorous analysis 
of the system. However, in the context of the S m i x = 
approximation for A/ > 0, it mainly affects the transi- 
tion between the two plateaus (Fig. 0). When the helix 



is modeled as a rigid rod the transition occurs as a jump 
that takes place at r = 7. On the other hand, when 
the WLC model is used the jump, while abrupt, takes 
place over a finite interval i.e., r\u — t^l is replaced by 
T\u < T2L < 7- For the A/ < (Fig. ||) the transition 
at the upper boundary of the "quasi plateau" is more 
gradual when the elasticity of the semiflexible helix is al- 
lowed for. Finally, note that the diblock approximation 
is recovered from the more rigorous treatments presented 
below when the N — ► 00 limit is taken before the a — » 
limit. When a — ► limit is approached for a finite chain 
it leads to a single domain. 



fa/kT 





FIG. 6. Same as previous figures for the s > 1 scenario. 
Both figures correspond to s = 1.1, 7 = 2/5 and n h = 1000. 



IV. RIGOROUS ANALYSIS 

The analysis presented in the previous section fails to 
allow for a number of important ingredients. The most 
important is the mixing free energy, —TS m i X , associated 
with the polydispersity of the domains. In this section we 
present an analysis that allows for the role of this mix- 
ing entropy. Because of S m i X the helix-coil transitions 
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no longer take place as first order phase transitions. In 
turn this has two important consequences: (i) The "per- 
fect" plateaus, with constant forces f\ co and /2 CO are re- 
placed by "imperfect" plateaus with a finite slope, (ii) 
The crossovers between the plateaus and the neighboring 
regimes become smoother. 

The analysis is based on a generalization of the ap- 
proach described in section II that is, all chain charac- 
teristics, Ph(n), P c (ri), 8 and y are obtained from the 
minimization of the appropriate free energy i.e., the equi- 
librium conditions. As in section III, the free energy is 
of the form 

Fchain — + F el (33) 

where, Fq is the free energy of the unperturbed free chain 
and F e i, accounts for the elasticity of the chain. However, 
in the following we utilize different expressions for these 
two terms. Since we aim to obtain Ph{n) and P c (n) it is 
necessary to utilize F as given by equation (FtI) . The han- 
dling of the elastic free energy is significantly different. 
In section III we used the Gaussian elasticity to describe 
the deformation behavior of the coils and assumed that 
the helices contribute to the elastic behavior as a geomet- 
ric constraint. While this rough description is sufficient 
at the level of the S m i X = approximation, a more re- 
fined treatment is needed for the detailed analysis to be 
undertaken below. In particular, it is necessary to allow 
for two additional features: (i) The finite extensibility of 
the chain and (ii) the gradual alignment of the helical do- 
mains with the applied tension. The freely jointed chain 
(FJC) model allows us to incorporate the finite extensi- 
bility of the coil domains. With regard to the helices we 
will explore two different descriptions that impose finite 
extensibility on the helices. First we will model the he- 
lical domains as rigid and unextendable rods and allow 
for their length dependent alignment by using the FJC 
model i.e., the helical domains are considered as freely 
jointed rigid rods forming a chain. This amounts to the 
assumption of an infinite persistence length for each he- 
lical domain. This picture is reasonable so long as the 
size of the helical domains is small compared to the per- 
sistence length, P. In the second approach, each of the 
helical domains is viewed as worm like chain (WLC) that 
is, semiflexible chain characterized by a finite persistence 
length and capable of undergoing undulations. It is as- 
sumed that the domains are large in comparison to P. 
An important difference between the FJC and the WLC 
models is that in the later the persistence length depends 
on the applied tension. Our approach utilizes the analy- 
sis of Marko and Siggia of the extension elasticity of the 
WLC jjij . The utilization of these models makes the con- 
stant / ensemble mathematically convenient. This choice 
of ensemble is also preferable because it corresponds bet- 
ter to the experimental situation encountered in single 
molecule measurements. In each case, we first obtain the 
equilibrium values of P/,(n), P c {n), 9 and y of a chain 
subject to a given tension, /, by minimizing F c hain(f) 



with respect to Ph(n), P c {n), 9 and y. This allows us to 
determine the equilibrium F c h a in(f) thus enabling us to 
calculate the fR diagram by using 

R = -dF chain (f)/df. (34) 



A. Rigid helices 

When the helical domains are viewed as rigid rods 
and the FJC is invoked, the reduced end-to-end distance 
r = R/Na of the polypeptide is 

oo oo 

r = y^2 P c (n) n C(x) + yy P h {ri) C(njx) (35) 

n—l n—l 

where x = fa/kT is the reduced tension. The first part 
of this expression corresponds to the projection of the 
coil monomers along the direction of the tension. The 
second part reflects to the contribution of the helical do- 
mains to r. Because the individual monomers within the 
coil domains are aligned independently, the domain size 
does not play a role. The extension of each coil domain 
is extensive with respect to the number of monomers, 
n. Consequently, this contribution may be expressed in 
terms of the first moment of the probability distribu- 
tion k c = (1 — 9)/y = ^ n nP c {n). As a result the full 
probability P c (n) is irrelevant. In marked contrast, the 
monomers within the helical domains are not aligned in- 
dividually but as part of the rod-like domains. The non- 
linearity introduced by the C(n"fx) term in the expression 
for r couples / and Ph(n). As a consequence, Ph{n) will 
be explicitly modified by the applied tension while P c (n) 
depends on / only implicitly, via 9 and y. 
Eliminating P c (n) from (|35|), we obtain 

oo 

r= (1 -6)C{x) + yJ2 P h (n) C(njx). (36) 

71=1 

This expression for R(f) enables us to obtain the cor- 
responding elastic free energy F e i(f) = — fl Rdf (Ap- 
pendix C) 

F i °° 

= -(1 - 0)C mt {x) - y^2 P h {n) £ lnt (njx) (37) 

n—l 

where Ci n t{x) = In [sinh(x)/x]. 

Altogether the free energy per monomer is thus 

F cham /NkT = -01n s - y In a (38) 

OO 

-(1 - 9)£ int (x) - yy P h {n) £ int (wyx) 

71=1 

oo oo 

+yY / Ph{n)\nP h (n)+yY / Pc(n)lnP c {n) 

n—l n—l 
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Since the / dependence of F c h airL is due only to F e i , (|34| ) 
yields i? as given by ([56]). As before, Ph{n) and P c (n) 
are determined by minimizing F c h a in{f) with respect to 
these probabilities subject to two constraints: (i) The 
normalization of the probabilities Pt{n) and P c (n) and 
(ii) the average sizes k c = (1 — 6)/y and fc^ = 6/y of the 
coil and helical domains. Upon substituting the result- 
ing P h {n) and P c (n) in F cham we obtain F cham (f) as a 
function of and y. 9 and j/ are then determined by the 
equilibrium conditions dF chain /d6 = dF chain /dy = 0. 
The details of the calculation involved are described in 
Appendix E. and y for a given tension / are 



cr (1 — — y) sA sinh(7:r) 
72; (1 - sAe~t x ) (1 - sAe-y x ) ' 
<7 (1-0-j/) , (l-sAe~^ 



where 



272 



A{x,0,y) 



In 



(1 



1 - sAe^ 



y)x 



(1 - 9) smh(x) ' 



(39) 
(40) 

(41) 



The probabilities P c (n) and Ph{n) are 

2/ f 1 -0-y' 



Pc(n) = 



1-0-2/ 



1 



□ / x cr „ sinh(n7x) 

fUn) = -{1-6 -y) {sA) . 

y wyx 



(42) 
(43) 



Ph{n), as P c {n), is essentially an exponentially decay- 
ing function. However, the characteristic decay constants 
are now dependent on /. The dependence is explicit for 
Ph{n) while for P c {n) the / dependence is implicit, via 9 
and y. 

In Fig. |t], the equilibrium and y, are plotted as func- 
tions of x = fa/kT for different values of a. Clearly, 
the results of the S m i X = or diblock approximation are 
approached in the limit of strong cooperativity, a — > 0. 
In particular, y approaches zero while exhibits a step 
like behavior. Thus, ~ outside the plateaus range, 
x < x\co = 0.30 and x > x^co — 2.55, while within 
the plateaus range, x\ co < x < X2co we find ~ 1 and 
y ~ 0. Consequently, the mean size of the helical do- 
mains, kh = 6/y, becomes very large compared to P. In 
turn, this suggests that the validity of the model of he- 
lical domains as rigid rods is questionable. Clearly, the 
rigid helices model is meaningful only while the persis- 
tence length is much larger than kh- 
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FIG. 7. Plots of 6 (a) and y (b) vs. fa/kT as obtained 
from the rigorous solution (dashed lines), the S m ix — or 
diblock approximation (continuous line) and the Mean Field 
approximation (dotted line) of the "rigid helices" model for 
s = 0.9 and 7 = 2/5. 



Once the equilibrium characteristics of the chain for 
a given / are found, it is possible to obtain the force- 
extension diagram. The reduced end-to-end distance, r, 
as a function of the reduced tension, x, is (Appendix E) 



r = (1 - 9)C{x) + 9-1 



coth(7x) — 



sA 



sinned;) 



y 

6~fx 



(44) 



The corresponding force-extension diagram is plotted in 
Fig. ||. Again, the results of the S m i X = or diblock ap- 
proximation are approached in the a — > limit. Note the 
"jump" in the force at R = Nja [jj5|. At this point, be- 
tween the two plateaus, the polypeptide is in a "purely" 
helical state, ~ 1 and y ~ 0. This jump is another 
indication of difficulties due to the approximation of the 
helices as rigid rods. As we shall see in the next sec- 
tion, this unphysical feature disappears when the finite 
persistence length of the helices is allowed for. 
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B. Flexible helices 



fa/kT 
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FIG. 8. The extension force law as obtained from the 
rigorous solution (dashed lines), the Smix — or diblock ap- 
proximation (continuous line) and the Mean Field approxi- 
mation (dotted line) of the "rigid helices" model for s = 0.9 
and 7 = 2/5. 



It is of interest to compare the exact solution of the 
model to the mean field approximation (MF) 015). The 
MF approximation involves the removal of the explicit 
coupling between P c (n), Ph(n) and /. As noted earlier, 
this decoupling is exact for P c (n). For Ph(n) the de- 
coupling is achieved by replacing y^2 n Ph(n)n r y£(n"fx) 
in (36) by 9jC(9jx/y). In effect, all helical domains are 
assigned an equal size of kh = 0/y and the effect of their 
polydispersity on F e i is ignored. The corresponding free 
energy is 



F 



MF 



NkT 



= —9 In s — y In a + [9 — y) In ■ 



y 



yln- + (1-9-y) In- 



1 - 1 



y m 



(45) 

y 



1 - 1 



-(1 - 9)C int (x) - y£ mt (9jx/y). 



The equilibrium conditions dF cham /d9 = dF chain /dy 
lead to 



{\-9){9-y) sx 



exp 



'yxC 



(46) 
(47) 



9(1-9- y) smh{x) 

y 2 

(6-y)(l-e-y) 

Qix \ v J iv \ y J 



The overall features of the MF solution are rather similar 
to the exact solution (Fig. ||). The two primary differ- 
ences are: (i) The MF solution indicates that the transi- 
tion between the plateaus involves a second order phase 
transition p(3| ] in marked distinction to the exact solu- 
tion, (ii) Because of (i) the plateaus do not exhibit an 
inflection point and the characteristic value of the slope, 
at r = 7, scales as a 2 [O. 



The assumption of perfectly rigid helical sequences im- 
plies an infinite persistence length. This assumption is 
reasonable so long as the size of the helical domains is 
small compared to the known values of their persistence 
length, P: A typical value is P — 2000A corresponding 
to nh — P/o-h ~ 10 3 monomers. As we have seen, the 
rigid rod approximation for an infinite chain leads to he- 
lical domains that are larger. In turn, this leads to an 
unphysical jump, located at r — 7, in the force-extension 
curve. 

Two approaches allow to circumvent this difficulty. 
One approach treats each helical monomer as an ex- 
tendable spring i.e., it introduces an elastic penalty as- 
sociated with the deviation of the length of the helical 
monomer from its equilibrium length ah- This approach 
was utilized in the analysis of the extension behavior of 
DNA [H) and of poly- (ethylene glycol) Q. In the follow- 
ing we adopt a different approach that proved successful 
in fitting the force curves of DNA. Within this approach 
the helical domains are considered as worm like chains 
(WLC) instead of rigid rods (Appendix D). 

In the WLC model the helical domains are viewed as 
semiflexible chains undergoing undulations. The angular 
correlations along the chain decay because of the ther- 
mal undulations. Within this model the decay length of 
the angular correlations, A, depends on the tension, A = 
A(/). For weakly stretched chains A is identical to the per- 
sistence length, P, of the unperturbed chain, A(/) = P 
while for stronger tensions A(/) = P(kT / f P) 1 / 2 and the 
correlations decay faster. A long helical domain of length 
rvya may thus be considered as a FJC of n , ya/2X(f) 
"rigid" segments of length 2A(/) and the reduced end- 
to-end distance is 



r = (1 - 9)C{x) + 9jC(a) 



(48) 



where C(x) is the Langevin function and a = a(nhjx) = 
2fX(f)/kT |59|. As before, the expression for r reflects 
two contributions. The first term corresponds to the 
alignment of the coil monomers with the applied ten- 
sion using the FJC model. The second term describes 
the alignment of the semiflexible helical domains. In the 
following we assume that the helical domains are larger 
than A(/) that is, A(/) <C wya. The error introduced 
by this assumption diminishes as / increases and A(/) 
decreases. Within this approximation each 2A(/) seg- 
ment in a helical domain is aligned independently. This 
is in marked contrast to the rigid helices model where all 
the monomers within a helical domain align as a unit. 
As a result we may utilize kh = 0/y = ~Y^ n nPh(n) in 
order to obtain yX^^Li Ph(n) wy C(a) = 9jC(a). Con- 
sequently, r is independent of the probabilities P c (n) 
and Ph(n). Expression ( f48| ) for R(f) enables us to ob- 
tain (Appendix D) the corresponding elastic free energy 

F el (f) = ~ Jo Rdf 
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F el /NkT = -(1 - 9)C lnt (x) - — (4n hl x 

4n h 



a) C (a) . 
(49) 



In marked distinction to the "rigid helices" scenario, this 
elastic free energy is independent of the probabilities 
Ph(n) and P c {n). Accordingly, Ph(n) and P c (n) are not 
modified explicitly by the extension and obey (^) and 
(g). In other words, the mean field approximation pro- 
vides an exact solution of the problem. Altogether, the 
free energy of a polypeptide, using the WLC model for 
the helical domains, is 



F, 



chain 



(/) 



NkT 



= — 9 In s — y In a + (9 — y) In ■ 



(50) 



+i,ln! + (l-0-y)ln- 



y _i_ „ i y 

h y In 



-(1 - 6)C mt (x) - - — [An h jx - a]C(a). 

This expression is similar to F c h a in for a free, undeformed 
chain (10). The two free energies differ in two respects: 
(i) ( |50| ) contains an additive term — Ci n t(x) which is in- 
dependent of 9 and y. (ii) The force independent s in 
( |l0| ) is replaced by a force dependent s(x) 



[£(x±) + j£(a±)] /2 and a± = a(rihjx±). The inner 
crossing point corresponds to xq for which s(x) is max- 
imal, xo satisfies the condition C(xq) — jC [a(nhjXo)] 
i.e., to an equality between the projected length of the 
coil and the helical monomers along the force. From 
and (|5l|), it is apparent that this condition is indepen- 
dent of a. The plateaus depicted in Fig. |9|b exhibit a 
small but finite slope. Its characteristic value, at the in- 
flection points x±, scales with a 1 / 2 
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s(x) 



sinh(x^ 



■ exp [(jx - a/An h ) C{a)\ 



(51) 



Consequently, the equilibrium conditions dF c h a in/d6 = 
dF c hain/dy = lead to expressions of the same form as 
those found for the free chains but with s = s(x) replac- 
ing s 



1 



s-1 



y 



2 2 7(1 
2 as [(§ 



I) 2 



1) 



Acts 



4 as 

1/2 



.5 + 1 + a/P - l) 2 +4 as 



(52) 
(53) 



Substitution of the equilibrium values of 9 and y for a 
given x in (|4^ ) yields the corresponding equilibrium end- 
to-end distance. 

Plots of 9 vs. fa/kT and fa/kT vs. R/Na, as ob- 
tained for the "semiflexible helices" scenario for different 
a, are depicted in Fig. g. The replacement of the "rigid 
helices" by semiflexible ones leads to the disappearance 
of the "jump" in the force law. Another difference with 
respect to Fig. 0a is that 9 = 1 is never attained. Again, 
we recover the appropriate diblock approximation in the 
limit of a — > 0. 

All the force-extension curves in Fig. gb cross at 
three different points. The external points correspond 
to a reduced force x± satisfying s(x±) = 1. In turn, 
this condition is equivalent to the equilibrium condition 
dF chain /d9 = for F cham corresponding to the appropri- 
ate diblock approximation in which the coil is modeled as 
FJC and the helix as a WLC j33|. In particular, x± cor- 
respond to points in the plateaus where 9 = 1/2, r± — 
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FIG. 9. Plots of the helical fraction 9 vs. the reduced 
force fa/kT (a) and plots of the reduced force fa/kT vs. 
r = R/Na (b) as obtained from the rigorous solution (dashed 
lines) and the Smix = approximation (continuous line) of 
the "semiflexible helices" model for s = 0.9, 7 = 2/5 and 
n h = 1000. 



V. DISCUSSION AND CONCLUSION 

Our analysis of the extension behavior of polypeptides 
utilized two models. The two differ in the description of 
the helical domains. In one model, the helical domains 
are viewed as perfectly rigid and unextendable rods. The 
"rigid helices" model is reasonable so long as the length of 
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the helical domains is small compared to the persistence 
length. In the second approach the helical domains are 
treated as long semiflexible polymers within the worm 
like chain (WLC) model. This approach is preferable 
when the length of the helical domains is larger than 
the force dependent decay length A(/). For each of the 
models, we presented a rigorous solution and two approx- 
imations. Within the diblock approximation one neglects 
the mixing entropy associated with the polydispersity of 
the domains. In the mean field approximation there is 
no direct coupling between the distribution of sizes of 
the domains and the applied tension. In all cases con- 
sidered, our analysis is based on the minimization of a 
phenomenological free energy. This approach is equiva- 
lent to the transfer matrix method but it is physically 
transparent and mathematically simple. 

The general features of the extension behavior of the 
polypeptides are independent of the model and the ap- 
proximation scheme. Three different scenarios are possi- 
ble for the extension of long polypeptides: (a) For T > T c 
the polypeptide remains in a coil configuration for all ex- 
tensions, (b) In the range T* < T < T c the chain is 
initially in a coil state but it undergoes a helix-coil tran- 
sition upon extension. The helical domains eventually 
melt upon stronger extension. Consequently, the force 
law exhibits two plateaus associated with a helix-coil co- 
existence, (c) For T < T* the unstretched polypeptide 
is a helix which melts upon stretching thus leading to a 
force law with a single plateau. 

In the second scenario, T* < T < T c , it is possible 
to distinguish between five regimes: (I) For weak exten- 
sions the polypeptide behaves as a random coil exhibiting 
a linear response. (II) Stronger extensions, and the asso- 
ciated loss of configurational entropy, favor the formation 
of helical domains. The associated coexistence of helical 
and coil domains gives rise to a lower plateau. (Ill) A 
steep crossover corresponding to the stretching of a stiff, 
mainly helical, polymer leads to (IV) a second, higher 
plateau due to the break-up of the helices when the im- 
posed end-to-end distance is larger then the length of a 
fully helical chain. This plateau is also associated with 
helix-coil coexistence. (V) When the helical content is 
reduced to zero the chain exhibits a strong extension be- 
havior of a random coil, with deviations from the linear 
response behavior due to finite extensibility effects. 

In the third scenario, T < T*, the regimes (I) and (II) 
disappear and regime (III) extends down to R = 0. The 
small slope of the force law in this regime (Fig. [h]) is 
reminiscent of a plateau. However, this quasi-plateau is 
not associated with a helix-coil coexistence. Rather, it is 
due to the easy alignment of the persistent segments in 
the helix. 

The two models, "rigid" and "semiflexible" helices, dif- 
fer with regard to the precise details of the force diagram. 
Thus, region (III) occurs as a "jump" within the "rigid 
helices" model but is a steep yet gradual increase in the 
"semi-flexible helices" model. The slope of the plateaus 
within the rigid helices model scales as a [In a + In | In a\] 2 



|15[ while in the "semi-flexible helices" model it scales as 
ct t ' 2 . Overall, the crossovers between the various regimes 
are sharper in the "rigid helices" model. 

The diblock approximation, with the appropriate elas- 
tic term, allows to recover the main features, length 
and force scales, of each model. However, within this 
Smix = approximation the transition takes place as a 
first order phase transition. As a result it yields perfect 
plateaus, / = f\ co or / = f2co, and sharp crossovers. 
The mean field approximation, where the polydispersity 
is decoupled from the tension is exact for the "semiflex- 
ible helices" model. In the case of the "rigid helices" 
model this approximation, while it captures the overall 
features of the force law also gives rise to a number of 
artifacts Jig ]. 
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0.2 0.4 0.6 0.8 

R/Na 

FIG. 10. Plots of the reduced force fa/kT vs. the reduced 
end-to-end distance R/Na for the s > 1 scenario (s — 1.1, 
7 = 2/5 and nn = 1000). Long dashed line: "rigid helices" 
model (a = 0.01), dashed line: "semiflexible helices" model 
(a = 0.001) and continuous line: corresponding diblock ap- 
proximations. 



Neither of the models considered captures the physics 
completely. The "rigid helices" model fails to describe 
the behavior of helical domain that are long compared 
to the persistence length. Within the "semiflexible he- 
lices" model all helical domains are assumed to be long 
in comparison to A(/). Accordingly, this model is likely 
to mishandle short helical domains that do behave as 
rigid rods. The importance of this deficiency diminishes 
however as / increases and A(/) decreases. A full de- 
scription of the system requires a model combining the 
positive features of the two models described above. 

While our analysis focused on the case of ho- 
mopolypeptides forming a -helices, our approach can be 
extended to more complicated systems. In the case of 
heteropolypeptides it is necessary to allow for the residue 
dependence of A/. For double stranded DNA it is neces- 
sary to allow for long range interactions as given by the 
Stockmayer formula for a JId|-[44[ . 
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APPENDIX A: PROBABILITY OF SIZES 

The minimization of the free energy (Q) with respect 
to the probabilities Ph(n) and P c (n) yields 



or 



y In P h (n) = n\ -y + , 
y\nP c (n) = n\-y + n^ c 2 . 



(Al) 
(A2) 



These are solved by P h {n) = A h B% and P c {n) = A C B™ 
where 



A h (c) = exp 



B h (c) = cxp 



' He) N 

Mi -y 



' He) ' 



y 



(A3) 
(A4) 



The parameters Ah, A c , Bh and B c are determined using 
the constraints 



l = J2Ph(n) = j 



A h B h 



-=Y d nP h {n) = - 
11 — . 1 



Bu 

A h B h 



71=1 



l = J2Pc(n) = - 



(l-B h ) 2 ' 

ArBr. 



71=1 



1 - 



= ]TnP c (n) = - 



-B e ' 

ArW 



(l-5c) 2: 



(A5) 
(A6) 
(A7) 
(A8) 



yielding A ft = y/(6-y), B h = (0-y)/O, A c = y/(l-6-y) 
and B c = (1 — 6 — y) / (1 — 9) thus leading to the expres- 
sions (I) for P h (n) and (|) for P c (n). 



APPENDIX B: FRACTIONS OF HELICES AND 
OF HELICAL MONOMERS 

The equilibrium conditions dF Q /d9 = dF /dy = 
with Fq given by (fLCfl) lead to 



(l-6)(9-y) = s6(l-0-y), 

y 2 =a(9-y)(l-9-y). 



From (B2), we obtain 
1 



V= T« 



y/l + 4a9(l-9) - 1 



(Bl) 
(B2) 



(B3) 



y 



2 0(l-i 



v/l + 4a0(l-0) + l 



(B4) 



where a = (1 — cr)/<r. The second condition (Bl) yields 
|(1-0)( S -1) 



y 



'(« + !) -1 



(B5) 



Combining these two equations allows us to eliminate y 
and obtain an equation for as a function of s and a, 



s/i + 4a0(l - 0) + 1 
leading to 

= iU^L 

2 



26(8 + 1) 
s - 1 



2 y/( S -l) 2 +i(T8 

2as((s~ If + Aas)- 1 ' 2 
s + l + y/(s - l) 2 + Acts ' 



(B6) 

(B7) 
(B8) 



APPENDIX C: THE FREELY JOINTED CHAIN 
MODEL 

The freely jointed chain model may be considered 
as the macromolecular analog of the Langevin theory 
of paramagnetism E^j. A flexible chain comprising N 
monomers is the counterpart of iV non interacting mag- 
netic dipoles. The monomer length a is the analog of the 
magnetic moment /x, the applied tension f plays the role 
of the magnetic field B and the induced magnetic mo- 
ment, M is the analog of the end to end distance R. Each 
monomer is assigned an orientational energy —facoscj) 
where (j> is the angle between the force and the segment. 
The reduced end-to-end distance R/Na of the polymer, 
at a temperature T, is the mean value of cos </>: 



R 

Na 



(cost 



cos t^" 58 * sin 



271 



= C(x) (CI) 



e^^sin 



where x = fa/kT is the reduced force and C(x) — 
cothx — 1/x is the Langevin function. The correspond- 
ing elastic free energy in the fixed / ensemble, as given 
by F e i(f) = — Rdf, leads to the following free energy 
per monomer: 

Fel(f) 



NkT 



(C2) 



where Ci n t(x) = In [sinh x jx\ , is the integrated Langevin 
function. 

The elastic free energy in the fixed R ensemble 
F e i(R) = / fl fdR' = Rf + F e i(f) is the Legendre trans- 
form of Fei(f) where r = R/Na and x are related by r = 
C(x). For weak extensions F e i(R) reduces to the familiar 
Gaussian form F el (R) ~ 3kTR 2 /2Na 2 while for strong 
extensions it diverges as F e i(R) ~ In / ~ — ln(l — r). 
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APPENDIX D: 



THE WORMLIKE CHAIN 
MODEL 



Within the wormlike chain model (WLC) the polymer 
is viewed as a bendable rod ptfij] . In the following we 
consider the case of a chain of constant contour length L 
and a single bending modulus E = kTP where P is the 
persistence length of the unperturbed chain. The chain 
trajectory is described by the position of a point on the 
chain, R(s), at the contour length s. The unit tangent 
vector u(s) = dR/ds specifies the local orientation of 
the chain. A straight rod corresponds to u(s) = const' 
or du/ds — 0. Accordingly, the bending energy is a 
quadratic function of du/ds. Since u ■ du/ds = 0, the 
only quadratic form is 



E 



end 



OS 



du 



(Dl) 



and the conformational distribution of the chain, its par- 
tition function, up to a contour length s, is 



^ oc exp 



P 
~2 



ds' 



du 

ds* 



(D2) 



where ^ is a function of u(0) = u and u(s) i.e, ^ — 
\l/(uo, u(s)). This describes a Gaussian process with the 
constraint u 2 = 1. As such it is analogous to rotational 
diffusion and obeys = jpW^ where 5ft = u x ^ is 
the rotational operator that is related to the angular mo- 
mentum operator in quantum mechanics, L, as — «5ft = L. 
The correlations along the unperturbed chain decay as 
(u(s) • u(0)} = exp(— s/P). When the chain is subjected 
to an external field U ex t (u) the expression for "J becomes 



W oc exp 



1 



ds' / kT 



ds'U ext (u) 



(D3) 



The effect of tension f is specified by U ex t(u) — — f ■ u 
—fcoscj) thus leading to 



? = f is 



ds \2P 



/ COS (f) 

kT 



(D4) 



It is possible to find eigenstate satisfying ^ = — gip. For 
a long chain the free energy is extensive and the eigen- 
function expansion of ^(uq, u(L)) involves terms of the 
form exp(— gL)ip(uo)i/;(u(L)) where gkT is the free en- 
ergy per unit length due to the bending fluctuations. 

An analytical solution of equation (D4) is not available. 
Marko and Siggia argued that the WLC free energy is 
determined by the lowest g in the spectrum and utilized a 
variational approach in order to determine g. The force 
law is then recovered from R/L — —kTdg/df. To this 
end they chose the trial function tp(u) oc exp [acos(</>)/2] 



where a is a variational parameter and cj) is the angle be- 
tween u and f . As we shall see later, a = 2/A/fcT where 
A = A(/) is a force dependent decay length that replaces 
P in characterizing the decay of (u(s) • u(0)). Assuming 
Jdu 2 ip 2 {u) = 1 this leads to j47| 



qa = min Ga = min ( — — — x ) C(a) 

^ a a \AP 1 



(D5) 



where C(x) = cotha; — 1/x is the Langevin function 
and x = fa/kT is the reduced force. The argument 
a = a^^jx) that minimizes g is specified by dG/da = 
which leads to: 



C{a) = [Anh'jx — a]C'{a). 



(D6) 



where £{x) = dC{x)/dx. Here n h -yx = Pf/kT. The 
end-to-end distance is simply deduced from : 



R 



-L 



dga 
dx 



dGa 
dx 



dGa da 
da dx 



LC{a). (D7) 



This end-to-end distance is identical to the one obtained 
from the FJC model if the reduced force 2ri} l r yx |39) is re- 
placed by a(rihjx). Fo r small forces, when C{a) ~ a/3 
and C(a) ~ 1/3 (D6) leads to a(nhjx) ~ 2ni l r yx or 
A ~ P. For high tension a increases more slowly. In 
this limit C(a) w 1 - 1/q while £{a) « l/a 2 and ( |D6| ) 
yields a{n h -yx) ~ (An hl x) l / 2 or A w P(kT/fP)^ 2 . The 
difference between the variational approximation and the 
exact solution of the WLC model is less than 1.5% over 
the whole range of forces. The associated elastic free 
energy, F ei (/) = - // Rdf is 



F el (f)=LkTg 



(D8) 



since Rdf = —LkTdg an d where g corresponds to the 
explicit expression in ( |D5| ) with a = a(rihjx) according 
to ( p6| ) . In Eq. ( f49| ) this elastic free energy is used for a 
helical domain of length L = n^a with n monomers. 



APPENDIX E: RIGID HELICES 
APPROXIMATION 

In this appendix we determine the probabilities P c {n) 
and Ph(n) of a coil and a helical segment with n 
monomers as well as the fractions of helical monomers 
9 and of helices y. 

The extremum condition of F c h a in ( p8| ) with respect to 
P c (n) and Ph{n) yields 

y\nP c {n) = u\ - y + n^, (El) 
y In P h {n)= n\-y + nfi% + y C mt (n-yx) . (E2) 

This leads to 



P c (n) = exp \-n — 



(E3) 



Ph[n) = exp (±11 + n a£) siDh ^), ( E4 ) 

y y J njx 
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The probability P c (n) is easily determined since it is 
not modified by the stretching. The normalization of the 



probability P c {n) and the mean size k c 
^ n nP c (n) lead to (Appendix A) 



(1 - 0)/y 



exp 



y 



y 



exp 



and: 



Pc(n) = 



1-6 -y 



1 - i 



l-O-y 



(E5) 
(E6) 

(E7) 



The probability Ph(n) is modified by the stretching 
and is more difficult to obtain. To do so we consider the 
two remaining equilibrium conditions dF c h a in/dO = 
and dFchain/ dy = after substitution of the equilibrium 
form of P c (ri) as given by (E7) 



In s = — In 
In a = In 



1 -i 

y 



f<2 



l-6-y 

A® _ 

71=1 



J2 p h(n) foP h (n) 



(E8) 
(E9) 



y 



- ^2 Ph{n)Cint{n'jx). 



Using (E4) for Ph(n) we may rewrite ( |E9| ) as 
V 



In <7 = In 



1 - i 



y 



y 



i. 



From (E10) and (E8) we obtain 

'A-V\ a(l-6-y) 



exp 



y 
(A 

exp — 
V V 



Thus leading to 



sinhx 



(E10) 

(Ell) 
(E12) 



a (1 - 9 - y) n+1 { sx ^" sinh(n 7 x) 
Pft(n)= limh^J — ^£~- (E13) 

and y are obtained using the normalization condition 
for Ph(n) and the constraint on the average size of the 
helical domains kh = 9/y 



E 

n=l 



(1 



\n+l 



sx \ n sirLh(n 7 a;) j/ 



(1 — 9) n vsinhx/ 7172: 

/ ss \™ sinh(n7a;) 



(E14) 



- (l-g_y)n + l , 

^ (1-60™ Vsi 



sinh x 



7a; 



= -. (E15) 
cr 



Using the definition sinh(a;) = (e x — e x )/2 as well as the 
relations 



n=l 
00 



-X' 



vn 



(Ei6) 

(E17) 



71=1 



we obtain 

y 1 



u , /l - s^e^ 
— m 



27a; \_ 1 — sAe^' 
6 (1 - 9 - y) sA sinh(7x) 

(7 _ 72; (1 - sAei x ) (1 - s^e"^) ' 



where 



A(a;,0,y) 



(1 



(1 - 9) sinh(.T) ' 



(E18) 
(E19) 

(E20) 



These two equations for 9 and y can be easily solved 
numerically. To this end we introduce a new variable 
e = (1-9- y)/(l - 9). Equation p!8| ) may then be 
rewritten as 



1 - e = In 

272; 



1 - sA(a?, e)e"T a: 



1 - sA(x, e)e^ x 



(E21) 



with A(x, e) = ex/ sinh(x). This equation allows to de- 
termi ne e a s a function of s, cr, 7 and x. is then obtained 
from (E1E) in the form 



aesA(x, e) sinh(7x) 



1 



7x[l - sA(x, e)e-~t x ] [1 - sA(x, e)e^ a 



(E22) 



Knowing e and 0, y is simply determined from y — 
(l-e)(l-6). 

To obtain the end-to-end distance R as a function of 
the tension / we use R = —dF c h a in/df which leads to 

00 

r = (1 -9)C(x) +y^Ph (n)rvy C(njx). (E23) 

71=1 

The second term may be tackled exactly using the defi- 
nition C(x) = cothx — 1/x: 



r = (l-9)C(x)+y^r,Ph(n)ni 



cosh(n 7 :r) y 
sinh(n7a;) x 



(1 - 9)C(x) + — (1 - 9 - y) (A+ + A_) - ^ (E24) 



2x 



with 



A. 



sA(x,9,y)e ± ^ x 
1 - sA(x, 9, y)e ± "< x ' 



(E25) 



The expression for r may be simplified using (E19) 
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A i + A_ 



2sA(x, 9, y) cosh(7x) - 2s 2 A 2 (x, 9, y) 
[1 - sA(x, 9, y) e~< x } [1 - sA(x, 9, y) e~i x } ' 
2^7x[cosh(7x) — sA(x, 9, y)} 



cr(l — 9 — y) sinh(7a;) 



(E26) 



which yields to 

r = (1 - 9)C{x) + 9 1 



coth(7x) 



sA(x,9,y) 



sinh(7a;) 



y 

X 



(E27) 
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